AITopics | energy cost

Collaborating Authors

energy cost

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

A faster way to estimate AI power consumption

AIHubMay-19-2026, 12:59:13 GMT

Due to the explosive growth of artificial intelligence, it is estimated that data centers will consume up to 12 percent of total U.S. electricity by 2028, according to the Lawrence Berkeley National Laboratory. Improving data center energy efficiency is one way scientists are striving to make AI more sustainable. Toward that goal, researchers from MIT and the MIT-IBM Watson AI Lab developed a rapid prediction tool that tells data center operators how much power will be consumed by running a particular AI workload on a certain processor or AI accelerator chip. Their method produces reliable power estimates in a few seconds, unlike traditional modeling techniques that can take hours or even days to yield results. Moreover, their prediction tool can be applied to a wide range of hardware configurations -- even emerging designs that haven't been deployed yet.

artificial intelligence, natural language, question answering, (17 more...)

AIHub

Industry: Information Technology > Services (1.00)

Technology: Information Technology > Artificial Intelligence > Natural Language > Question Answering (0.37)

Add feedback

AI could put people off tech jobs and hurt the economy, warns Raspberry Pi boss

BBC NewsMay-14-2026, 23:01:20 GMT

The founder of British computer maker Raspberry Pi has warned that overestimating the abilities of Artificial Intelligence (AI) could put people off pursuing tech jobs and hurt the economy. Eben Upton told the BBC's Big Boss Interview podcast this could distort people's choices in ways that make that skill shortage worse and not better. Some people are very inclined to overestimate what these [AI] tools can do, he said, and warned against claims that it would destroy vast numbers of computing roles over the coming years. The rise of tools such as ChatGPT and Claude have led to predictions of huge job losses, particularly for tech workers and graduates. Amazon, Meta and Microsoft have already blamed tens of thousands of layoffs on AI over the last year.

artificial intelligence, chatbot, natural language, (6 more...)

BBC News

Country:

North America (1.00)
Europe > United Kingdom (1.00)

Industry:

Leisure & Entertainment > Sports (0.44)
Banking & Finance > Economy (0.36)
Media > Film (0.30)

Technology:

Information Technology > Communications > Mobile (0.37)
Information Technology > Artificial Intelligence > Natural Language > Chatbot (0.36)

Add feedback

Joint Energy Management and Coordinated AIGC Workload Scheduling for Distributed Data Centers: A Diffusion-Aided Reward Shaping Approach

Fu, Yang, Qin, Peng, Chen, Liming, Zhang, Zihao, Yu, Hao, Wang, Yifei

arXiv.org Machine LearningMay-6-2026

Artificial intelligence-generated content (AIGC) has emerged as a transformative paradigm for automating the creation of diverse and customized content, giving rise to rapidly growing computational workloads in cloud data centers. It is imperative for AIGC service providers (ASPs) to strategically schedule AIGC workloads to reduce data center energy costs while guaranteeing high-quality content generation. However, the distinctive characteristics of AIGC services pose critical challenges, including model heterogeneity across ASPs, implicit service quality evaluation, and complex inference process control. To tackle these challenges, we propose a joint energy management and coordinated AIGC workload scheduling framework, which introduces an explicit mathematical characterization of service quality to promote both job transfer among ASPs and fine-grained inference process configuration. Moreover, various energy resources within data centers are jointly considered to enhance power usage flexibility. Subsequently, a system utility maximization problem is formulated to balance AIGC service revenue with operational penalties and costs. Nevertheless, the strong coupling among job scheduling decisions induces severe reward sparsity, which limits the effectiveness of existing deep reinforcement learning (DRL) algorithms. To address this issue, we develop a diffusion model-aided reward shaping approach to synthesize complementary reward signals through a multi-step denoising process. This approach is seamlessly integrated with DRL to enable efficient learning of scheduling policies under sparse environmental feedback. Experiments based on real-world models and datasets demonstrate that our scheme effectively accommodates electricity price fluctuations and AIGC model heterogeneity, while achieving superior learning convergence and system utility compared with benchmark methods.

cloud computing, machine learning, reinforcement learning, (22 more...)

arXiv.org Machine Learning

2605.02965

Genre: Research Report (0.81)

Industry:

Information Technology (1.00)
Energy > Power Industry (1.00)

Technology:

Information Technology > Cloud Computing (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (0.86)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.68)

Add feedback

E2-Train: Training State-of-the-art CNNs with Over 80% Energy Savings

Yue Wang, Ziyu Jiang, Xiaohan Chen, Pengfei Xu, Yang Zhao, Yingyan Lin, Zhangyang Wang

Neural Information Processing SystemsFeb-12-2026, 10:16:14 GMT

Hence, manyefforts havebeen made towards efficient CNNinference inresource-constrained platforms. The increasing penetration of intelligent sensors has revolutionized how Internet of Things (IoT) works.

artificial intelligence, arxivpreprintarxiv, machine learning, (18 more...)

Neural Information Processing Systems

Country:

North America > United States > Wisconsin > Dane County > Madison (0.04)
North America > United States > Texas > Brazos County > College Station (0.04)
North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)

Industry:

Information Technology (0.48)
Education > Educational Setting (0.46)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.70)

Add feedback

NeuralStochasticControl

Neural Information Processing SystemsFeb-8-2026, 10:35:53 GMT

Control problems are always challenging since they arise from the real-world systems where stochasticity and randomness are of ubiquitous presence.

artificial intelligence, arxivpreprintarxiv, machine learning, (19 more...)

Neural Information Processing Systems

Country:

Europe > Germany > North Rhine-Westphalia > Cologne Region > Bonn (0.04)
Asia > China (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.68)

Add feedback

37693cfc748049e45d87b8c7d8b9aacd-Supplemental.pdf

Neural Information Processing SystemsFeb-8-2026, 05:46:40 GMT

adder detector, cnn-based detector, detector, (15 more...)

Neural Information Processing Systems

Technology: Information Technology > Artificial Intelligence > Machine Learning (0.50)

Add feedback

ShiftAddNet: AHardware-InspiredDeepNetwork

Neural Information Processing SystemsFeb-7-2026, 17:05:53 GMT

DNNs arelargely composed ofmultiplication operations for both forward and backward propagation, which are much more computationally costly than addition [1].

artificial intelligence, machine learning, shiftaddnet, (19 more...)

Neural Information Processing Systems

Country:

North America > Canada > British Columbia > Metro Vancouver Regional District > Vancouver (0.04)
Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.04)

Technology: Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.70)

Add feedback

ELANA: A Simple Energy and Latency Analyzer for LLMs

Chiang, Hung-Yueh, Wang, Bokun, Marculescu, Diana

arXiv.org Artificial IntelligenceDec-12-2025

The latency and power consumption of large language models (LLMs) are major constraints when serving them across a wide spectrum of hardware platforms, from mobile edge devices to cloud GPU clusters. Benchmarking is crucial for optimizing efficiency in both model deployment and next-generation model development. To address this need, we open-source a simple profiling tool, \textbf{ELANA}, for evaluating LLMs. ELANA is designed as a lightweight, academic-friendly profiler for analyzing model size, key-value (KV) cache size, prefilling latency (Time-to-first-token, TTFT), generation latency (Time-per-output-token, TPOT), and end-to-end latency (Time-to-last-token, TTLT) of LLMs on both multi-GPU and edge GPU platforms. It supports all publicly available models on Hugging Face and offers a simple command-line interface, along with optional energy consumption logging. Moreover, ELANA is fully compatible with popular Hugging Face APIs and can be easily customized or adapted to compressed or low bit-width models, making it ideal for research on efficient LLMs or for small-scale proof-of-concept studies. We release the ELANA profiling tool at: https://github.com/enyac-group/Elana.

large language model, latency, machine learning, (18 more...)

arXiv.org Artificial Intelligence

2512.09946

Country: North America > United States > Texas (0.14)

Genre: Research Report (0.40)

Industry: Energy (0.36)

Technology:

Information Technology > Artificial Intelligence > Natural Language > Large Language Model (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.33)

Add feedback

TokenPowerBench: Benchmarking the Power Consumption of LLM Inference

Niu, Chenxu, Zhang, Wei, Li, Jie, Zhao, Yongjian, Wang, Tongyang, Wang, Xi, Chen, Yong

arXiv.org Artificial IntelligenceDec-3-2025

Large language model (LLM) services now answer billions of queries per day, and industry reports show that inference, not training, accounts for more than 90% of total power consumption. However, existing benchmarks focus on either training/fine-tuning or performance of inference and provide little support for power consumption measurement and analysis of inference. We introduce TokenPowerBench, the first lightweight and extensible benchmark designed for LLM-inference power consumption studies. The benchmark combines (i) a declarative configuration interface covering model choice, prompt set, and inference engine, (ii) a measurement layer that captures GPU-, node-, and system-level power without specialized power meters, and (iii) a phase-aligned metrics pipeline that attributes energy to the prefill and decode stages of every request. These elements make it straight-forward to explore the power consumed by an LLM inference run; furthermore, by varying batch size, context length, parallelism strategy and quantization, users can quickly assess how each setting affects joules per token and other energy-efficiency metrics. We evaluate TokenPowerBench on four of the most widely used model series (Llama, Falcon, Qwen, and Mistral). Our experiments cover from 1 billion parameters up to the frontier-scale Llama3-405B model. Furthermore, we release TokenPowerBench as open source to help users to measure power consumption, forecast operating expenses, and meet sustainability targets when deploying LLM services.

large language model, machine learning, natural language, (20 more...)

arXiv.org Artificial Intelligence

2512.03024

Genre: Research Report (0.64)

Industry: